An Effective Probabilistic Skyline Query Process on Uncertain Data Streams

نویسندگان

  • Chuan-Ming Liu
  • Syuan-Wei Tang
چکیده

With the evolution of technology, the ways to acquire data and the applications of data are more diverse. As data volume continuously grows, the data quality may not be high as usual. The data can be defected, imprecise or inaccurate due to the process of data acquiring. Recently, the skyline query is widely used in data analysis to derive the results that meets more than one specific condition simultaneously. For example, the forest monitoring system, which collects the temperature and humidity of the surrounding environment with sensors, to monitor the disasters. Using the skyline query, the zones of potential fire hazards can be found in time, where the temperature is high and the humidity is low. In the mentioned application, the derived data change with time. We refer to such data as data streams. The constant change and uncertainty of data make the query process difficult and need more computations. Thus, how to have an effective skyline query process in terms of time and space over uncertain data streams becomes crucial. In this paper, we discuss this problem and propose an effective approach, Efficient Probabilistic Skyline Update (EPSU), using a new data structure by augmenting the R-tree structure. The relevant algorithms are analyzed and discussed. Last, we perform the simulated experiments extensively with synthetic data to validate the EPSU approach. The results show that EPSU can effectively compute the probabilistic skyline query in terms of the time and space and outperforms the existing ones. c © 2014 The Authors. Published by Elsevier B.V. Peer-review under responsibility of the Program Chairs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PhD Thesis Efficiently and Effectively Processing Probabilistic Queries on Uncertain Data Candidate

Uncertainty is inherent in many real applications. Uncertain data analysis and query processing has become a critical issue and has attracted a great deal of attention in database research community recently. The thesis, therefore, targets an important and challenging topic uncertain data management. It is a high quality and well-written PhD thesis. Five important and related aspects of uncerta...

متن کامل

Probabilistic Skyline Queries over Uncertain Moving Objects

Data uncertainty inherently exists in a large number of applications due to factors such as limitations of measuring equipments, update delay, and network bandwidth. Recently, modeling and querying uncertain data have attracted considerable attention from the database community. However, how to perform advanced analysis on uncertain data remains an interesting question. In this paper, we focus ...

متن کامل

Continuous Probabilistic Skyline Queries over Uncertain Data Streams

Recently, some approaches of finding probabilistic skylines on uncertain data have been proposed. In these approaches, a data object is composed of instances, each associated with a probability. The probabilistic skyline is then defined as a set of non-dominated objects with probabilities exceeding or equaling a given threshold. In many applications, data are generated as a form of continuous d...

متن کامل

Efficient Query Processing Techniques in Uncertain Databases

Query processing on uncertain data has become increasingly important in many real-world applications. In this paper, we present our works on formulating and tackling three important queries in uncertain databases, that is, probabilistic group nearest neighbor (PGNN), probabilistic reverse skyline (PRSQ), and probabilistic reverse nearest neighbor (PRNN) queries.

متن کامل

Reporting l most influential objects in uncertain databases based on probabilistic reverse top-k queries

Reverse topk queries are proposed from the perspective of a product manufacturer, which are essential for manufacturers to assess the potential market. However, the existing approaches for reverse topk queries are all based on the assumption that the underlying data are exact (or certain). Due to the intrinsic differences between uncertain and certain data, these methods cannot be applied to pr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015